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Abstract: 

We present an advanced interpolation method for estimating smooth spatiotemporal 
profiles for local highway traffic variables such as flow, speed and density. The method 
is based on stationary detector data as typically collected by traffic control centres, 
and may be augmented by floating car data or other traffic information. The resulting 
profiles display transitions between free and congested traffic in great detail, as well as 
fine structures such as stop-and-go waves. We establish the accuracy and robustness of 
the method and demonstrate three potential applications: 1. compensation for gaps in 
data caused by detector failure; 2. separation of noise from dynamic traffic information; 
and 3. the fusion of floating car data with stationary detector data. 

1 INTRODUCTION 



A detailed picture of speed and flow is essential for understanding flow breakdown on 
highways and the dynamics of congestion. In particular, highway traffic may not be un- 
derstood by time series data alone but rather we must consider its structure in space and 
time j ointly (|Treiber et all l200d : iBertini et all . l2005al lbh . Let t and x denote respectively 
time and distance driven down the highway. We may thus introduce spatiotemporal pro- 
files for the macroscopic variables velocity V{x, t), flow q(x, t) and density p(x, t). Where 
the quality of data permits, these quantities may be displayed as colour charts or land- 
scapes which exhibit rich structure such wave propagation, phase transitions etc. To 
continue the analogy with particle physics, we may speak of the traffic state, by which 
we mean the classification of the traffic at any one time accordin g to its spatiotemporal 
struc ture, e.g., as free flow, synchronized flow, stop -and-go waves feerner and Rehborn . 
199fih . or in terms of more detailed classifications (jHelbing et all Il999l . I20091 ). 



Unfortunately, highway traffic data comes from many heterogeneous sources (jvan Lint and Hoogendoorn 



200fll ). Most simply we have stationary detector data (SDD) collected by fixed infras- 
tructure, which typically consists of inductance loops buried in the surface of the road. 
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In their usual operation, the loops count vehicles and estimate their lengths and speeds, 
which are then sent to regional traffic control centres in the form of 1-minute aggregate 
data. However, modern communication systems have sufficient bandwidth to carry full 
Individual Vehicle Data (IVD) to the control centre, and this data may lead to advances 
in incident detection algorithms, for example. More recent stationary detection systems 
operate in a similar fashion to inductance loops, but are based on magnetometers, and 
radar / laser / infrared devices installed on bridges. 

Other traffic data is not provided at f i xed points in space, for example floating-car data 
(FCD ) from GPS devices dFastenrathl. 1 19971: llvan and Sethil . 1998; iHerrera and Bayed . 
20081 ) , or floating-phone data (FPD) (jCaceres et aI.Ll2008h . Yet further data is produced 
on an event-oriented basis such as messages from the police. 

Two problems with such highway traffic data sets are: 

1. Sparseness. Each source of data individually may be insufficient to determine the 
traffic state. For example, the distance between consecutive stationary detectors 
may be too great to infer what is happening between them. This problem is com- 
pounded by detector failure. In the case of FCD, it is a relatively small proportion 
of vehicles that report GPS data. 

2. Noise, of several different types, for example: 1. measurement error committed by 
detectors; 2. sampling errors in aggregate data due to small numbers (a problem 
in low flow conditions); and 3. heterogeneity of the driver / vehicle population (so 
that FCD for one vehicle may not be at all representative of those around it). 

However, both sparseness and noise can be addressed to some degree by combining data 
from either similar or heterogeneous sources, and the focus of this paper is a method 
which can fuse heterogeneous data in order to reconstruct smooth spatiotemporal profiles 
for local traffic variables such as the speed. 



Ou r method is based on the adaptive smoothing method (ASM) (ITreiber and Helbing , 
2002J). Whereas the original ASM was restricted to SDD only, here we consider the 



generalized adapt ive smoothing method (GA SM) which is extended to cope with hetero- 
geneous sources ( Resting and Treiber . 20081 ) . In particular, the GASM can interpolate 
locally inconsistent data: for example, an FCD speed measurement need not equal a 1- 
minute aggregate SDD with the same (x, t) coordinates, but the GASM can nevertheless 
smoothly combine these disparate measurements. 

Both the ASM and GASM interpolate between data points (thereby tackling the 
problem of incomplete data coverage) and eliminate high-frequency noise while pre- 
serving most of the relevant dynamic information. However, their chief novelty is that 
they surpass non-specialized smoothing methods by incorporatin g some traffic physics, 



pnys 



in the form of well-understood wave propagation characteristics (IHelbing et al.1 . |2009). 
Specifically, it is known that in congestion, small perturbations to the traffic prop- 
agate upstream at about 15 k m/h, whereas in free - flow t hey propagate down s tream 
at the speed of the vehicle s ( Kerner and Rehborn . 19961 : Cassidv and Bertini . 19991 : 
Schonhof and Helbind . lioOTT ). 
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The chief contributions of this paper are an investigation of the robustness and ac- 
curacy of the GASM and a demonstration of some potential applications, including the 
fusion of FCD and SDD. The organization is as follows. In Sec. El we formulate the 
GASM and discuss calibration and validation issues based on data from the German 
autobahn A9 and the English M42 motorway. Since the detector coverage on the M42 is 
extremely dense (loop detector spacings of 100 m), the validation (Sec. 12. 5p will be based 
on real (not simulated) traffic dynamics which can be considered as completely known 
for our purposes. In Sec. O we propose possible applications of this method such as 
bridging data gaps (Sec. 13. IT) , separating noise from information (Sec. 13.2] ). and fusion of 
floating-car and stationary detector data (Sec. 13.3]) . Finally, the method and the results 
will be briefly discussed in Sec. HI 



2 GENERALIZED ADAPTIVE SMOOTHING METHOD 



We now present the details of the generalized adaptive smoothing method (GASM) which 
performs two-dimensional interpolation to reconstruct the spatiotemporal traffic state 
from discrete traffic data. Speed data x>i measured at known locations xi and times ti 
are obtained from either SDD or FCD, or on an event-oriented basis, and are combined 
to produce a smooth velocity field as a function of continuous space and time. With 
minor modifications, flow and other spatiotemporal variables may also be reconstructed. 

The GASM is based on two-dimensional interpolation in space and time (Sec. 12. ip 
using smoothing kernels. However, in contrast to a conventional isotropic filter, the 
method incorporates the known characteristic velocities of information propagation in 
free and congested traffic (Sec 12. 2p . by skewing the principal axes of the smoothing 
kernel. The switch between free and congested traffic is then managed by a nonlinear 
adaptive speed filter (Sec. 12.3]) . We then demonstrate the effectiveness of the GASM by 
comparing it with conventional smoothing with an isotropic kernel (Sec. 12. 4p . Finally, 
in Sec. 12.51 we validate the GASM using M42 data where the inductance loop system 
is over-specified. The approach is to apply the GASM to a subset of the inductance 
loop data and re-construct the velocity field at the positions of detectors which have not 
been used in the interpolation. The accuracy of the GASM may then be established in 
comparison to the detector data which is regarded as the ground truth. 

2.1 Conventional Spatiotemporal Interpolation 

Our inputs are a set of discrete data points {xi,ti,Vi}, i = 1, . . . , n, and the interpolation 
task is to derive from them a smooth velocity field V(x,t) in a given spatiotemporal 
interval. The broad approach is to employ the convolution 




(1) 
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where the smoothing kernels (pi(x,t) are sufficiently localized functions that decrease 
with increasing \x\ or \t\, and we define the normalization factor by 

J^(x f t) = =- n r. (2) 

i 

This formulation allows for different types of data point to use different kernels, but 
to simplify matters we shall usually assume that the kernels are identical and take the 
symmetric exponential form 



(x,t) = exp 



\x\ \t\ 
— + — 
a t 



(3) 



although a bivariate Gaussian would also be suitable. The smoothing kernel acts as a 
kind of low-pass filter, and the positive constants a and r define characteristic 'widths' 
for the spatial and temporal smoothing respectively, so that features with finer scales 
tend to be smoothed out. 

Suitable values for a and r are of the order of half the typical distance between 
neighboring data points. For example, in a typical situation where an inductance loop 
system provides one-minute aggregate data at 2 km intervals, we choose r = 30 s and 
a = 1km (see Sec. 12. 5p . However, for a stronger reduction of noise, larger smoothing 
widths may be chosen (e.g., r up to 2min). 

2.2 Traffic-Adaptive Smoothing 

A serious challenge in traffic data is that the typical scale of some traffic patterns, such as 
the wavelength of stop-and-go waves, is (at 1-2 km) similar to the spacing of stationary 
detectors. Consequently, important dynamical features may be lost in the interpolation 
process, and even entirely spurious patterns may be reconstructed (see Fig. [2]). 

To enhance the resolution of the filter, we may use established facts concerning the 
propagation of information in traffic flow. The 'propagation velocity' is that at which 
small perturbations to the traffic flow are propagated, and in traffic theories based on 
hyperbolic partial differential equations, it corresponds to the characteristic wave velocity 
given by the gradient of the equilibrium flow-density curve (the so-called 'fundamental 
diagram'). It is well-known that: 

1. In free traffic, perturbations move downstream (i.e., i n the direction of traffic flow) 
( Kerner and Rehborn . 19961 ; Schonhof and Helbine . 20071 ) and the characteristic 



propagation velocity Cf ree is similar to the local average speed of the vehicles. 

2. In congested traffic, however, perturbations travel against the movement of the 
vehicles (i.e., upstream). Moreover, the characteristic propagation velocity c cong 
is usually in the vicinity of -15 km/h. This value is well-established as the typical 



veloc ity of stop-and-go waves (jKerner and Rehbornl . Il996l : ISchonhof and Helbing , 



2007), but seems to apply more generally to nearly all information propagation in 



congested traffic. 
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To apply these facts, we skew the conventional isotropic smoothing kernel ([3]) in order 
to obtain the anisotropic interpolation formulae 



Vir ee {x,t) 
^cong(-E> t) 



1 ^ — ^ / X — X{ 
—r 2_^<P\ X - Xi,t-ti | r, 



Af(x,t) 

1 

M{x,t) 



1 v-^ / X — Xi 
-r } 0( X - Xi,t-ti | r, 



--cong 



(4) 
(5) 



for free and congested traffic respectively. In effect, these new filters correspond to 
smoothing in preferred directions in the (x, t) plane (see Fig. [I]), based on the propagation 
velocities Cf ree and c con g- Note that the conventional isotropic smoothing corresponds to 
the limit Cf ree = c CO ng — ► oo. 

In practice, we take c con g ~ — 15km/h, whereas we find that Cf ree ~ +70km/h 
gives good results in a highway context. The use of ju st two characteristic propaga- 
tion velocities is consistent with kinematic wave models ( Cassidy and Windower . 19951 : 
Cassidv and Bertinil. I1999H and t he assumption of a piecewise linear ('triangular') fun- 
damental diagram ~(|Newelll . Il993h . 




Figure 1: Illustration of the smoothing kernels for free and congested traffic. The inclination 
angles capture the different characteristic velocities Cf ree and c CO ng- In particular, perturbations 
propagate upstream (against the driving direction) in congested traffic. 



2.3 Nonlinear Adaptive Filter 

We must now construct a single smoothing filter which combines the formulae for free 
and congested traffic. To this end, we define 

V(x,t) = w(x,t) V cong (x,t) + [1 - w(x,t)\ V iicc (x,t), (6) 
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Parameter 



Value 



Smoothing width in space coordinate a Ax/2 

Smoothing width in time coordinate r At/2 

Propagation velocity of perturbations in free traffic Cf rcc 70km/h 

Propagation velocity of perturbations in congested traffic c cong -15km/h 

Crossover from congested to free traffic Vthr 60km/h 

Transition width between congested and free traffic AV^ 20km/h 



Table 1: Parameters of the adaptive smoothing algorithm with typical numerical values used 
in this paper. The spatial and temporal smoothing widths are chosen as half of the average 
inter-detector spacing Ax and sampling time St, respectively. 



where the weight factor w(x,t) = W (Vf Tee (x,t),V cong (x,t)) controls the superposition 
of the free and congested velocity fields dU),©. We require W « at high speeds and 
W ~ 1 at low speeds, and thus we use the smooth s-shaped function 



W{V iree ,V cong ) = - 



1 + tanh 



Vthr - mm(V ivee ,V c 



cong, 



AV 



(7) 



Here, the 'predictor' min(Vf ree , V cong ) is defined such that the patterns of congested traffic 
are better reproduced by the resulting non-linear filter than that for free traffic. The 
threshold between free and congested traffic is defined by Vthr while the transition width 
is determined by AV. Typical parameter values are given in Tabled! 

2.4 Sensitivity Analysis 

For illustration, we apply the GASM to a small portion of one-minu te aggregate detector 
data from the South - boun d A9 autobahn near Munich, Germany ( Treiber et al. . 2000l : 



Treiber and Helbinal . l2002h . We consider data from 9 detectors spread over 14km of 



highway during 4 hours of a busy morning on which there were pronounced stop-and-go 
patterns. Here, as throughout the paper, 1-minute aggregate SDD points also incorpo- 
rate a flow-weighted aggregate of speed measurements across the lanes of the highway. 
In fact, in congested traffic, the speed variance between lanes tends to be rather small, 
and aggregation across lanes thus helps reduce sampling noise. 

See Fig. [21 which compares the performance of GASM with the standard isotropic 
filter (pQ). Here GASM uses the standard parameters from Table [TJ The positions of the 
detectors which are used in reconstruction are indicated by horizontal lines. Compare 
Figs. EJa,b): we may observe that GASM is able to resolve individual stop-and-go waves 
when isotropic smoothing is not able to identify the pattern. As a more difficult chal- 
lenge, in Figs. [2jc,d) the smoothing algorithms are compared when data from only 6 out 
of the 9 detectors is used. In this case isotropic smoothing identifies spurious patterns 
although GASM continues to reconstruct stop-and-go waves correctly even though the 
spatial resolution of the input data is very poor. 

Our experience is that GASM performs better than isotropic smoothing in all of the 
data sets that we have tried. Moreover, we have found that the parameter choices in 
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Figure 2: Generalized Adaptive Smoothing Method (a,c) versus conventional isotropic interpola- 
tion (b,d) applied to loop detector data from the German autobahn A9 (near Munich, direction 
South). In (c,d), only six out of the nine detectors are used in reconstruction. Conventional 
interpolation may neglect important spatiotemporal features or even identify spurious structure. 
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Figure 3: Sensitivity of the generalized adaptive smoothing method to variation in its parameters 
(cf. Fig. [2j full details in the main text). 



TableHJare robust and do not need re-tuning for each new application. As an illustration, 
we apply GASM to the same data as for Fig. [21 but with pathological changes to the 
algorithm parameters. See Fig. El 

In Fig. E|a), we change the propagation velocities Cf ree from 70km/h to 200km/h, and 
Ccong from — 15km/h to — 12km/h. In contrast, in Fig. EJb), we modify the nonlinear 
filter, by reducing the transition width from AV = 20km/h to 5km/h, and the crossover 
threshold from Vthr = 60km/h to 45km/h. To assess the robustness, these plots should 
be compared with the corresponding results for GASM with standard parameters and 
for conventional isotropic smoothing, in Figs. EJc,d) respectively. 

On visual inspection, the quality of the GASM depends only weakly on its parameters 
and in all cases surpasses isotropic smoothing at this spatial resolution. We have found 
that the most sensitive parameter is the propagation velocity c cong for congested traffic, 
since too low or too high values result in step-like artefacts in the reconstruction. How- 



ever, in practice, c mng varies very l ittle from situation to situation (jKerner and Rehbornl . 



19961 : ISchonhof and Helbind . 120071 ) 



In conclusion, we have yet to perform a formal optimisation of the GASM parame- 
ters. But our experience is that GASM reconstructs traffic patterns robustly with the 
parameter choices of Table [Hand stationary detector spacings up to about 3 km. Near a 
bottleneck, this spacing should preferably be reduced so that the stationary downstream 
jam front can be accurately positioned. 

2.5 Validation 

For validation of the GASM, we consider a 15-kilometer long section of the North-bound 
M42 motorway near Birmingham, England. As part of the English Highways Agency's 



Activ e Traffic Management system ([English Highways Agency's Active Traffic Management system homepagd . 



20091 ) this highway has been equipped with an almost unprecedented coverage of indue 



tance loop detectors, with a typical nominal spacing of 100 m. In consequence, spa- 
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tiotemporal patterns may be identified without any interpolation process at all. Thus in 
effect, the ground truth is directly available and we may use it to definitively evaluate 
the performance of interpolation algorithms. 

As a test case, we take data from Friday January 11, 2008, and in Fig. 2] we display a 
scatter plot of 1-minute aggregate lane-average speed, showing a complex spatially ex- 
tended pattern incorporating several bottlenecks and large amplitude stop-and-go waves. 
Close examination of this raw data largely supports the use of just two distinct propa- 
gation velocities Cf ree and c con g, validating the overall GASM technique. 




Figure 4: Reference situation used for validation. The inter-detector spacing is 100 m (40 m in 
the vicinity of x — 12 km). The data are visualized as a spatiotemporal scatter plot. Each data 
point corresponds to the local speed aggregated over all lanes and over 1 min. No further data 
processing has been applied. 

For a quantitative investigation, we apply the GASM with standard parameters to 
input data chosen from just a small selection of the available detectors. The interpolated 
field V(x, t) is then compared to speed data at detectors which are half way between 
those whose data has been used in the reconstruction. For example, at a spacing of 1 km 
corresponding to Fig. [5ja), the error measure is based on the detectors at x = 2.5 km, 
x = 3.5 km and so forth. Fig. [5] displays the reconstructed traffic states with reduced 
sets of loop detectors. In summary, the most important features are identified even when 
the detector spacing is increased to 4 km. 

Fig. [6] presents RMS errors of the reconstructed velocities, averaged over all applicable 
test sites, as a function of the detector spacing. To assess the quality of the GASM, we 
compare it with conventional isotropic smoothing, that is, setting Cf ree and c con g to oo. 
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For a given detector spacing, the quality of the GASM can be compared to that of 
conventional smoothing when about twice as many detectors are available. Specifically, 
when using the GASM, the quality of the reconstruction at a detector spacing of 2.5 km is 
comparable to that of isotropic smoothing when detectors are available every kilometer. 





Figure 5: Reconstruction of the reference situation of Fig.[4]by the adaptive smoothing method 
(standard parameter set) applied on reduced data sets with detector spacings between 1 km and 
4 km. The locations of detectors whose data has been used in the reconstruction are indicated 
by horizontal lines. 



3 APPLICATIONS 

We now give three illustrative examples of potential applications for the GASM: 1. 
Compensation for detector failure (Sec. 13. ip ; 2. Separation of noise from true traffic 
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Figure 6: Reconstruction of time series for speed using the adaptive smoothing method (lower 
curve) and conventional smoothing (upper curve). Shown is the RMS velocity deviation with 
respect to the actual measurement, averaged over all detector sites that are half way between 
the detectors available to the reconstruction methods. The deviation is plotted as a function of 
the spacing of the available detectors. 



dynamics (Sec. I3.2|) : and 3. Fusion of floating car data (FCD) with stationary detector 
data (SDD) (Sec. El . 



3.1 Compensation for Detector Failure 

Common operational problems with inductance loop systems include 

• a temporary but simultaneous failure of all detectors covering a certain road sec- 
tion, usually due to a failure in the communications sub-system; or 

• the permanent failure of one or a few detectors, often due to their installation not 
meeting specified standards. 

In either case, the resulting SDD has 'gaps', and we may estimate the data that is 
missing by interpolating the data that is extant. 

Our example is of the former type and is taken from the South-bound A5 autobahn 



near Frankfurt, Germany, on the morning of August 6th, 2001, see Fig. Hand (jSchonhof and Helbingi . 



20071 ) for full details of the site. There is a 20min breakdown of all detectors between 
08:59 and 09:19. During this period, the detection system becomes 'frozen' for 10 min- 
utes and then records zero speed for a further 10 minutes. Fig. E|a) displays the resulting 
spurious spatiotemporal field recovered by the GASM. 

However, additional error bits signal the detector failure and the correct approach is 
thus to eliminate the period 08:59 to 09:19 and apply the GASM to what remains, see 
Fig. E^b). Observe that the GASM can bridge the gap in the data in a mostly natural 
way, although an artefact is introduced in the transition from congested to free traffic at 
the bottleneck at kilometer 482. This is because the GASM is well-tuned to reconstruct 
missing information in structures with velocities Cf ree and c con g, but not to reconstruct 
stationary structures. 
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To further clarify the properties of the reconstructed state, Figs. E^c-f) display time 
series of speed at two selected detectors. Each plot shows the original detector data, and 
Figs. EKc,e) compare it with the erroneous GASM reconstruction whereas Figs. Efd,f) 
compare with the correct reconstruction with the erroneous data removed. Notice that 
the reconstructions in Figs. [7^d,f) do not correspond to a simple interpolation in time, 
because of the way in which the GASM incorporates spatial information from other 
detectors. 

Further investigations have shown that temporal data gaps of up to 30 min and spatial 
gaps up to 3 km can typically be compensated for. In the latter case, however, the result 
depends strongly on the position of the failed detectors relative to bottlenecks. 



3.2 Elimination of Noise 

A significant problem in SDD is noise due to the time interval (typically 1 minute) 
over which raw data is aggregated by the road-side signal processing subsystems. In 
particular, when the traffic flow rate is low, the aggregate may be constructed from only 
a handful of individual vehicle observations, which themselves are prone to measurement 
error, and in consequence it is subject to gross statistical sampling error. The two 
situations of concern are dense queuing traffic and sparse high-speed traffic, and the 
latter case is subject to the additional problem that the variance in individual vehicles' 
true velocities may be very high. 

A common real-time application is to use speed and flow thresholds from single de- 
tectors to trigger queue warnings on Variable Message Signs immediately up-stream of 
a stop-and-go wave. It is thus crucial that stop-and-go waves are identified correctly 
and that alerts are not triggered by statistical outliers. To reduce the effect of sampling 
error, one may aggregate data over wider time intervals. However, there is not a clear 
separation of time scales and sharp features such as the boundaries of the waves may get 
lost if the aggregation window is broadened too far. Hence in general, it is non-trivial 
to separate noise from dynamics in detector time series. 

For illus tration, we again con s ider t he South-bound A5 autobahn near Frankfurt, 
Germany ( Schonhof and Helping . 20071 ). with data now taken from July 9, 2001, on 



which there were a number of strong stop-and-go waves. See Fig. [81 which displays 
the time series of speed from a single stationary detector and compares conventional 
temporal smoothing using the kernel exp(— \t\/r) with r = 60s and the GASM with 
standard parameters. 

In summary, the GASM is much more effective than simple temporal filters at reducing 
noise whilst retaining structure. In effect, the GASM uses spatial information, by blend- 
ing the time series of nearby detectors, to enhance vehicle counts without broadening 
the temporal averaging window. The difference between noise and real traffic oscilla- 
tions can thus be identified, because traffic information is correlated between detectors 
whereas noise is not. 
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Figure 7: Detector failure between 08:59 and 09:19 (South-bound A5 autobahn on August 6th, 
2001). Vertical lines in all plots indicate the period of failure. Plots (a,c,e) show results when 
GASM incorporates erroneous data and (b,d,f) shows the correct results when GASM omits the 
erroneous data and fills the gap by interpolation. 
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Figure 8: Time series for a stationary detector at location x — 473.0 km on the German 
Autobahn A5-South (see Fig. [7] for location — but this data is taken from a different day). 
The original data (thick solid line) is compared with the reconstruction at this position using 
the generalized adaptive smoothing method (thin solid), and with conventional smoothing (thin 
dashed line). 




3.3 Fusion of Floating Car Data 



As Intelligent Transport Systems are rolled out further across our trunk road networks, 
savings in infrastructure costs may be achieved if inductance loops are installed much 
more sparsely than is typical for the busiest highways. However, a detailed picture 
of traffic flow may still be derived if we also have access to some FCD , for exam- 
ple fro m high-end GPS services (such as the IT IS (|lTIS Homepage! . I2009T ) or Traffic- 
master ( TrafHcmaster - Intelligent Driving . 2009I ) systems in the United Kingdom) or 
from data from the m obile phone networks, that is so-called floating-phone data (FPD) 
( Caceres et al. . 2008I ) which is incorpo rated into high-end navigation services such as 
TMCpro in Germany (jTMC-prol . lionih . 

As a final application of the GASM, we consider the fusion of FCD and SDD. Our il- 
lustration uses s ynthetic data genera ted by microsimulation based on the Human Driver 



Model (HDM) (ITreiber et all I2006T). wh ich is a refinement of the well-known Intelli- 
gent Driver Model (ITreiber et all In the HDM, the reaction-time and multi- 
anticipatory effects have been calibrated so that the quantitative details (wavelength, 
amplitude etc.) of spatiotemporal traffic patterns are reproduced. 

Our specific example simulates a 12 km section of a single-lane highway without junc- 
tions over more than 2 hours. Rush-hour conditions are simulated by a peak in the in- flow 
(traffic demand) at x = km. To induce traffic patterns, a flow-conserving bottleneck is 
applied at x = 11km by increasing the car-following model's time-headway parameter 
at this point. Four stationary detectors are then simulated at x = 2,8,10,12 km, and 
just 10 vehicles out of a total of 2,750 in the simulation are selected randomly to provide 
speed data at 10 s resolution. These SDD and FCD are presented in the scatter plot 
in Fig. [9l We then reconstruct spatiotemporal velocity profiles by applying the GASM 
to different combinations of the SDD and FCD data. See Fig. [TU1 Here Fig. fTUlfa) dis- 
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Figure 9: Scatter plot of simulated Floating Car and Stationary Detector speed data. 



plays the ground truth profile derived by using FCD from all vehicles in the simulation. 
Note that for the reconstruction based on SDD alone, not only interpolation but also 
extrapolation is required at the extreme upstream and downstream locations. 

In Fig. [TUlfd) SDD and FCD are combined using the same kernel function, except that 
each FCD point has been given double the weight of an SDD point. In practice one 
would need to experiment with this relative weighting and / or tune it to the particular 
application under consideration. 

In summary, the SDD reconstruction outperforms FCD in resolving the stationary 
downstream front at the bottleneck at x = 11km, whereas the FCD reconstruction 
outperforms SDD in resolving stop-and-go waves. The reconstruction using both FCD 
and SDD together combines the best properties of either individual reconstruction. This 
example shows that the incorporation of even very small amounts of FCD (one vehicle 
in every 200 or 300) can significantly improve the reconstruction of the traffic state. 

4 DISCUSSION 

In summary: in this paper we have demonstrated how the Generalised Adaptive Smooth- 
ing Method (GASM) can solve practical problems in the analysis of highway traffic pat- 
terns. In particular it can 1. compensate for gaps in data caused by detector failure; 
2. reduce noise due to either sampling or measurement error (whilst compromising res- 
olution less than a purely temporal filter); and 3. fuse heterogeneous data. Although 
we have focussed on the reconstruction of the spatiotemporal velocity field, other fields 
(e.g., flow) may also be reconstructed: to achieve this, use alternative data in the linear 
formulae (j4]) and ([5|) but retain unchanged the dependence on velocity in the nonlinear 
component ([6]), (J7|). 

We have established that the performance of the GASM is quite robust to changes in 
its parameters. In consequence, it does not need tuning for each new scenario, and for 
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Figure 10: Spatiotemporal speed profiles: (a) ground truth; (b) using Stationary Detector Data 
only; (c) using Floating Car Data only; (d) combining Stationary Detector and Floating Car 
Data. 



freeway applications, the values from Table[T]may be used with confidence. A substantial 
benefit of the method is that it is not based on any one specific model of highway traffic 
and therefore may be used in the objective benchmarking of different models. On the 
other hand, a fine-tuning of the parameter c con g may be used for research purposes to 
determine the propagation velocity of congested traffic patterns — and when formulated 
in r.m.s. error this method is related to the standard one based on the cross- correlation 
of time series from different stationary detector locations ( Zielke et al. . 20081 ) . 



However, the results here represent only a first analysis and a detailed calibration and 
validation study remains for future work. We have yet to carry out a formal optimisation 
of the GASM parameters (e.g., using over-specified inductance loop data, as from the 
English M42 motorway), but there are other ways in which the details of the formulation 
that we have presented may be varied, and they should be analysed systematically using 
a variety of data from different highways and different countries. For example, we might 
experiment with different kernel functions (and note in practice these are implemented 
with a finite range 'cut-off' for computational efficiency). Also, the relative weighting 
and parametrisation of the kernels applied to different types of data point (individual 
vehicle or aggregate) needs investigation, and if possible a rigorous grounding in Bayesian 
statistics. 

In fact, in either wholly congested or wholly free traffic, the GASM is linear and 
thus its performance may be compared definitively to a more general space of linear 
filters tuned for any one highway scenario. However, in practice, it is not so easy to 
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generalise the nonlinear switch ©, © which is key to the reconstruction of complete 
traffic patterns. 

Unlike the ASDA/FOTO method (jKerner et alJ . l2004l ), the GASM in its present form 
is not suited to online applications. In principle, our method might be used to extrapolate 
data from the past into the near future, and thus form the basis of a short-term fore- 
casting algorithm. In fact, stop- and- go waves are forecasted very well by this approach. 
The problem is the behaviour of the GASM at bottlenecks where there is commonly a 
congestion pat t ern w hose downstream front is stationary with respect to the highway 
(jHelbing et all 120091 ). At present, the GASM does not distinguish different congested 



flow regimes and so in extrapolation to the future, it will erroneously propagate this 
structure away from the bottleneck at velocity c con g- The correction of this problem 
remains for future work, as does the incorporation of other aspects of traffic physics 
which may be useful in forecasting, such as the conservation of vehicles. 

From the point of view of infrastructure design and specification, it is interesting to 
investigate the optimal positioning of a fixed number of stationary detectors, in order for 
the GASM to yield the best reconstruction of the velocity field. To answer this question 
we should again return to the performance of the method in an environment like the M42 
motorway where there is a redundancy of detectors. We suspect that detectors should 
be placed more densely near bottlenecks, but the optimal configuration is unknown. 

Finally, the increasing availability of Floating Car Data (FCD) may allow much coarser 
stationary detector data (SDD) in future. A systematic investigation of the effects of av- 
erage SDD spacing, SDD aggregation time, FCD penetration rate, and FCD aggregation 
time / event-based protocols remains for future work. 
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